Implementation and Performance Analysis of the φ-Failure Detector

نویسندگان

  • Naohiro Hayashibara
  • Xavier Défago
  • Takuya Katayama
چکیده

Failure detection is a fundamental building block for ensuring fault tolerance in distributed systems. However, providing accurate and flexible failure detection in off-the-shelf distributed systems is difficult. Practical solutions to failure detection rely on some adaptive mechanism to cope with the unpredictability of networking conditions. However, while they provide reasonably good accuracy, they also lack the necessary flexibility to provide failure detection as a system-wide service. In particular, traditional solutions take a “one size fits all” approach, which prevents them from simultaneously supporting several distributed applications with very diverse QoS requirements. In this paper, we present a novel approach to adaptive failure detection, called φ-failure detector, which addresses the flexibility issue mentioned above. We describe an implementation, and analyze its behavior over intercontinental communication links during several weeks. Our experimental results show that our failure detector compares well with other known adaptive failure detection mechanisms, with the considerable advantage that it provides virtually limitless flexibility.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A performance study of the conceptual implementation of the GEM-tracking detector in Monte Carlo simulation

  PANDA experiment (antiProton ANnihilation at DArmstadt) is one of the key projects of the future FAIR facilities to investigate the reactions of antiprotons with protons and nuclear targets.   experiment is designed to serve as a completely extraordinary physical potential due to exploiting the availability of cold and high-intensity beams of antiprotons. One of the significant parts of the  ...

متن کامل

The Φ Accrual Failure Detector

Detecting failures is a fundamental issue for fault-tolerance in distributed systems. Recently, many people have come to realize that failure detection ought to be provided as some form of generic service, similar to IP address lookup or time synchronization. However, this has not been successful so far. One of the reasons is the difficulty to satisfy several application requirements simultaneo...

متن کامل

Performance Analysis of Hierarchical Failure Detector

We present a new failure detector implementation. This implementation, a variant of the heartbeat failure detector, is both adaptable and designed for scalability. Its first specificity of our implementation lies in the fact that it is designed as a shared service among several applications by way of an adaptation layer between the failure detector and the application. This layer adapts the qua...

متن کامل

Covariance Analysis of a vector tracking GPS receiver based on MMSE multiuser Detection

In high dynamic conditions, using vector tracking loops instead of scalar tracking loops in GPS receivers is proved as an efficient method to compensate the performance. The Minimum Mean Squared Error detector as a multiuser detector is applied in the vector tracking loop for more reliability and efficiency. The Kalman filter does the two tasks of tracking and extracting the navigation data aft...

متن کامل

Two-ways Adaptive Failure Detection with the φ-Failure Detector

It is widely recognized that distributed systems would greatly benefit from the availability of a generic failure detection service. Such a service can however prove useful only if it can adapt simultaneously to changing network conditions and conflicting application requirements. This paper presents a novel approach to adaptive failure detectors, called φ-failure detectors, which dynamically a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003